Predicting gene function from patterns of annotation.

نویسندگان

  • Oliver D King
  • Rebecca E Foulger
  • Selina S Dwight
  • James V White
  • Frederick P Roth
چکیده

The Gene Ontology (GO) Consortium has produced a controlled vocabulary for annotation of gene function that is used in many organism-specific gene annotation databases. This allows the prediction of gene function based on patterns of annotation. For example, if annotations for two attributes tend to occur together in a database, then a gene holding one attribute is likely to hold the other as well. We modeled the relationships among GO attributes with decision trees and Bayesian networks, using the annotations in the Saccharomyces Genome Database (SGD) and in FlyBase as training data. We tested the models using cross-validation, and we manually assessed 100 gene-attribute associations that were predicted by the models but that were not present in the SGD or FlyBase databases. Of the 100 manually assessed associations, 41 were judged to be true, and another 42 were judged to be plausible.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Predicting the Family Function based on Early Maladaptive Schemas and Couples Communication Patterns (Case Study: Education)

Purpose: The aim of this research was predicting the family function based on early maladaptive schemas and couple’s communication patterns. Methodology: Present study was descriptive from type of correlation. The research population was married female employees working in public and non-public schools of Tehran city and their spouses in 2017-2018 academic years. The research sample was 482 peo...

متن کامل

Exploiting ontology graph for predicting sparsely annotated gene function

MOTIVATION Systematically predicting gene (or protein) function based on molecular interaction networks has become an important tool in refining and enhancing the existing annotation catalogs, such as the Gene Ontology (GO) database. However, functional labels with only a few (<10) annotated genes, which constitute about half of the GO terms in yeast, mouse and human, pose a unique challenge in...

متن کامل

Primary root growth, tissue expression and co-expression analysis of a receptor kinase mutant in Arabidopsis

There is no functional annotation for the majority of the several hundreds of receptor-like kinases in plants. A direct way of inferring the function of these proteins is to study the phenotype that results from loss of function mutants such as T-DNA mutant lines. In this research a function (phenotype) to At2g37050 gene that encodes a receptor like kinase in Arabidopsis T-DNA line was...

متن کامل

Fast integration of heterogeneous data sources for predicting gene function with limited annotation

MOTIVATION Many algorithms that integrate multiple functional association networks for predicting gene function construct a composite network as a weighted sum of the individual networks and then use the composite network to predict gene function. The weight assigned to an individual network represents the usefulness of that network in predicting a given gene function. However, because many cat...

متن کامل

ESG: extended similarity group method for automated protein function prediction

MOTIVATION Importance of accurate automatic protein function prediction is ever increasing in the face of a large number of newly sequenced genomes and proteomics data that are awaiting biological interpretation. Conventional methods have focused on high sequence similarity-based annotation transfer which relies on the concept of homology. However, many cases have been reported that simple tran...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Genome research

دوره 13 5  شماره 

صفحات  -

تاریخ انتشار 2003